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System  Modeling 


Results  of 
Physical 
Experiments 
&  Device 
Modeling 


WHEN  systems  fail,  HOW  do  they  fail? 

We  use  this  information 
to  develop  &  test  reliable  architectures 
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Tasks  (Talk  Outline) 

ACCOMPLISHED 

•  Verilog  models  of  two  bootable 
microprocessors  (simple/advanced) 

•  Initial  work  on  robust  architectures 
(reliability  studies,  draft  floorplan) 

IN  PROGRESS 

•  Verilog  model  of  microprocessor  with 
hardware  checkpoint/repair 

FUTURE  WORK 

•  Fabricate  and  test  physical  designs 

•  Enhanced  Verilog  and  SPICE  software 
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Processor  Model:  simple 


Single-Issue,  In-Order,  Five-Stage  Pipeline 
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Processor  Model:  simple 

Single-Issue,  In-Order,  Five-Stage  Pipeline 


16-bit  processor  core 
Handles  interrupts  precisely 
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Processor  Model:  advanced 

High-Performance  Out-of-Order  Core 
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Processor  Model:  advanced 

PowerPC  ISA  with  AltiVec  SIMD  Unit 


Design  &  layout  of  core  in  progress 
(AltiVec  multiplier  above) 
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Robust  Architectures 


INITIAL  STUDY:  Assume  the  Worst  Case 
Checkpoint/Repair ... 


FAULT 

DETECT 


TIME 


A 

Resume  Execution 

State  i-2  State  i-1  State  i  I  from  STATE  i 


CHECKPOINT: 

Periodically  save  KNOWN-GOOD  STATE 
In  case  of  FAULT,  REPAIR  to  last  saved  state 
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Robust  Architectures 


INITIAL  STUDY:  Assume  the  Worst  Case 


ECC  encoding 

Implementation  can  be  single-  or  multi-chip 
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Robust  Architectures 


Success 

rate 
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Robust  Architectures 


Initial  Floorplan  (simple  CPU  +  safestore) 
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Globally  Asynchronous 

ASSUMPTION:  Clock  Net  is  Weak  Point 
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Processing  elements  generate  local  clocks 
Synchronize  at  inter-node  communication 
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Future  Work 

Synthesize,  Fabricate,  Test  Canonical  Forms 

•  Clock  trees 

•  Busses  (w/  and  w/o  ECC) 

•  Memories  (w/  and  w/o  ECC) 

•  Arithmetic/Logic  Units 

Incorporate  Drift-Diffusion  into  CAD  tools 
(more  accurate  SPICE/Verilog  modeling) 
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